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(54) Titic: A CANCER DL\GNOSTIC METHOD BASED UPON DNA METHYLATION DIFFERENCES 



(57) Abstract 

There is disclosed a cancer diagnostic method based upon DNA methylation 
differences at specific CpG sites. As set forth in the Figure, the method comprises 
bisulfite treatment of DNA, followed by methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE) for detennination of strand-specific methylation status at cytosine 
residues. 
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A CANCER DIAGNOSTIC METHOD BASED UPON DN A METHYLATION 

DIFFERENCES 



5 Technical Field of the Invention 

The present invention provides a cancer diagnostic method based upon DNA 
methylation differences at specific CpG sites. Specifically, the inventive method provides for a 
bisulfite treatment of DNA, followed by methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE), for determination of strand-specific methylation status at cytosine 
10 residues. 

Background of the Invention 

Cancer treatments, in general, have a higher rate of success if the cancer is diagnosed 
early and treatment is started earlier in the disease process. The relationship between improved 
1 5 prognosis and stage of disease at diagnosis hold across all forms of cancer for the most part. 
Therefore, there is an important need to develop early assays of general tumorigenesis through 
marker assays that measure general tumorigenesis v^thoui regard to the tissue source or cell 
type that is the source of a primary tumor. Moreover, there is a need to address distinct genetic 
alteration pattems that can serve as a platform associated with general tumorigenesis for early 
20 detection and prognostic monitoring of many forms of cancer. 
Importance of DNA Methylation 

DNA methylation is a mechanism for changing the base sequence of DNA without 
altering its coding function. DNA methylation is a heritable, reversible and epigenetic change. 
Yet. DNA methylation has the potential to alter gene expression, which has profound 
25 developmental and genetic consequences. The methylation reaction involves flipping a target 
cytosine out of an intact double helix to -allow the transfer of a methyl group from S- 
adenosylmethionine in a cleft of the enzyme DNA (cystosine-5)-methyltransferase 
(Klimasauskas et al.. Cell 76:357-369, 1994) to form 5-methylcytosme (5-mCyt). This 
enzymatic conversion is the only epigenetic modification of DNA known to exist in vertebrates 
30 and is essential for normal embryonic development (Bird. Cell 70:5-8, 1 992; Laird and 

Jaenisch, Human Moi Genet. 3:1487-1495, 1994; and Bestor and Jaenisch, Cell 69:915-926, 
1992). The presence of 5-mCyt at CpG dinucleotides has resulted in a 5-fold depletion of this 
sequence in the genome during vertebrate evolution, presumably due to spontaneous 
deamination of 5-mCyt to T (Schoreret et al., Proc, Natl. Acad Sci. USA 89:957-961. 1992). 
35 Those areas of the genome that do not show such suppression are referred to as "CpG islands" 
(Bird. Nature 321 :209-213, 1986; and Gardiner-Garden et al., J. Mol BioL 196:261-282, 
1987). These CpG island regions comprise about 1% of vertebrate genomes and also account 
for about 1 5% of the total number of CpG dinucleotides (Bird, Infra.). CpG islands are 
typically between 0.2 to about I kb in length and are located upstream of many housekeeping 
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and tissue-specific genes, but may also extend into gene coding regions. Therefore, it is the 
methylation of cytosine residues within CpG islands in somatic tissues, which is believed to 
affect gene function by altering transcription (Cedar, Cell 53:3-4, 1988). 

Methylation of cytosine residues contained within CpG islands of certain genes has 
5 been inversely correlated with gene activity. This could lead to decreased gene expression by a 
variety of mechanisms including, for example, disruption of local chromatin structure, 
inhibition of transcription factor-DN A binding, or by recruitment of proteins which interact 
specifically with methylated sequences indirectly preventing transcription factor binding. In 
other words, there are several theories as to how methylation affects mRNA transcription and 
10 gene expression, but the exact mechanism of action is not well imderstood. Some studies have 
demonstrated an inverse correlation between methylation of CpG islands and gene expression, 
however, most CpG islands on autosomal genes remain unmethylated in the germline and 
methylation of these islands is usually independent of gene expression. Tissue-specific genes 
are usually unmethylated and the receptive target organs but are methylated in the germline and 
15 in non-expressing adult tissues. CpG islands of constitutively-expressed housekeeping genes 
are normally unmethylated in the germline and in somatic tissues. 

Abnormal methylation of CpG islands associated with tumor suppressor genes may also 
cause decreased gene expression. Increased methylation of such regions may lead to 
progressive reduction of normal gene expression resulting in the selection of a population of 
20 cells having a selective growth advantage {i.e., a malignancy). 

It is considered that altered DNA methylation patterns, particularly methylation of 
cytosine residues, cause genome instability and are mutagenic. This, presumably, has led to an 
80% suppression of a CpG methyl acceptor site in eukaryotic organisms, which methylate their 
genomes. Cytosine methylation further contributes to generation of polymorphism and germ- 
25 line mutations and to transition mutations that inactivate tumor-suppressor genes (Jones, 

Cancer Res. 56:2463-2467, 1996). Methylation is also required for embryonic development of 
mammals (Bestor and Jaenisch, Cell 69:915-926, 1992). It appears that that the methylation of 
CpG-rich promoter regions may be blocking transcriptional activity. Therefore, there is a 
probability that alterations of methylation are an important epigeneiic criteria and can play a 
30 role in carcinogenesis in general due to its function of regulating gene expression. Ushijima el 
al. (Proc. Nati Acad. ScL USA 94:2284-2289. 1997) characterized and cloned DNA fragments 
that show methylation changes during murine hepatocarcinogenesis. Data fi'om a group of 
studies of altered methylation sites in cancer cells show that it is not simply the overall levels 
of DNA methylation that are altered in cancer, but changes in the distribution of methyl 
35 groups. 

These studies suggest that methylation. at CpG-rich sequences known as CpG islands, 
provide an alternative pathway for the inactivation of tumor suppressors, despite the fact that 
the supporting studies have analyzed only a few restriction enzyme sites without much 
knowledge as to their relevance to gene control. These reports suggest that methylation of 
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CpG oligonucleotides in the promoters of tumor suppressor genes can lead to their inactivation. 
Other studies provide data that suggest that alterations in the noimai methylation process are 
associated with genomic instability (Lengauer et al. Proc. Natl Acad, ScL USA 94:2545-2550, 
1997). Such abnormal epigenetic changes may be found in many types of cancer and can, 
5 therefore, serve as potential markets for oncogenic transformation, provided that there is a 
reliable means for rapidly determining such epigenetic changes. The present invention was 
made to provide such a universal means for determining abnormal epigenetic changes and 
address this need in the art. 
Methods to Determine DNA Methylation 
10 There is a variety of genome scanning methods that have been used to identify altered 

methylation sites in cancer cells. For example, one method involves restriction landmark 
genomic scanning (Kawai et al.. MoL Cell. BioL 14:7421-7427, 1994), and another example 
involves methylation-sensitive arbitrarily primed PGR (Gonzalgo et al.. Cancer Res. 57:594- 
599, 1997). Changes in methylation patterns at specific CpG sites have been monitored by 
1 5 digestion of genomic DNA with methylation-sensitive restriction enzymes followed by 
Southem analysis of the regions of interest (digestion-Southern method). The digestion- 
Southern method is a straightforward method but it has inherent disadvantages in that it 
requires a large amount of DNA (at least or greater than 5 |ig) and has a limited scope for 
analysis of CpG sites (as determined by the presence of recognition sites for methylation- 
20 sensitive restriction enzymes). Another method for analyzing changes in methylation patterns 
involves a PCR-based process that involves digestion of genomic DNA with methylation- 
sensitive resuiction enzymes prior to PCR amplification (Singer-Sam et al., Nuci Acids Res, 
1 8:687,1990). However, this method has not been shown effective because of a high degree of 
false positive signals (methylation present) due to inefficient enzyme digestion of 
25 overamplificaiion in a subsequent PCR reaction. 

Genomic sequencing has been simplified for analysis of DNA methylation patterns and 
5-methylcytosine distribution by using bisulfite treatment (Frommer et al., Proc, Nad Acad, 
ScL USA 89:1827-1831, 1992). Bisulfite treatment of DNA distinguishes methylated from 
unmethylated cytosines, but original bisulfite genomic sequencing requires large-scale 
30 sequencing of multiple plasmid clones to determine overall methylation patterns, which 

prevents this technique from being commercially useful for determining methylation patterns 
in any type of a routine diagnostic assay. 

In addition, other techniques have been reported which utilize bisulfite treatment of 
DNA as a starting point for methylation analysis. These include methylation-specific PCR 
35 (MSP) (Herman et al. Proc Nati Acad, ScL USA 93:9821-9826, 1992); and restriction enzyme 
digestion of PCR products amplified from bisulfite-converted DNA (Sadri and Homsby, NucL 
Acids Res. 24:5058-5059. 1996; and Xiong and Laird, NucL Acids Res. 25:2532-2534, 1997). 

PCR techniques have been developed for detection of gene mutations (Kuppuswamy et 
al., Proc NatL Acad, ScL USA 88:1 143-1 147, 1991) and quantitation of alleUc-specific 
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expression (Szabo and Mann, Genes Dev. 9:3097-3108, 1995; and Singer-Sam el al., PCR 
Methods AppL 1:160-163, 1992). Such techniques use internal primers, which anneal to a 
PCR-generated template and terminate immediately 5' of the single nucleotide to be assayed. 
However an allelic-specific expression technique has not been tried within the context of 
5 assaying for DNA methylation patterns. 

Therefore, there is a need in the art to develop improved diagnostic assays for early 
detection of cancer using reliable and reproducible methods for determining DNA methylation 
patterns that can be performed using familiar procediires suitable for widespread use. This 
invention was made to address the foregoing need. 

10 

Summary of the Invention 

The present invention provides a method for determining DNA methylation patterns at 
cytosine sites, comprising the steps of: 

(a) obtaining genomic DNA from a DNA sample to be assayed; 
1 5 (b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 

cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methylation analysis; 

(c) performing a PCR amplification procedure using the top strand or bottom strand 
20 specific primers; 

(d) isolating the PCR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, [^^PjdNTPs 
and Tag polymerase, wherein the Ms-SNuPE primers comprise from about a 15 mer to about a 
22 mer length primer that terminates immediately 5' of a single nucleotide to be assayed; and 

25 (f) determining the relative amount of methylation at CpG sites by measuring the 

incorporation of different '-P-labeled dNTPs. 

Preferably, the [^^P]NTP for top strand analysis is [^-P]dCTP or [^'P]TTP. Preferably, 
the ["-P]NTP for bottom strand analysis is ["PjdATP or [^^P]dGTP. Preferably, the isolation 
step of the PCR products uses an electrophoresis technique. Most preferably, the 

30 electrophoresis technique uses an agarose gel. Preferably, the Ms-SNuPE primer sequence 
comprises a sequence of at least fifteen but no more than twenty five, bases having a sequence 
selected from the group consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 
[SEO ID NO. 3], HuNl [SEQ ID NO. 51, HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO, 71, 
HuN4 [SEQ ID NO. 8], HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 91, CaSl [SEQ ID NO. 

35 101, CaS2 [SEQ ID NO. 11], CaS4 [SEQ ID NO. 12], and combinations thereof 

The present invention further provides a Ms-SNuPE primer sequence designed to 
anneal to and terminate immediately 5' of a desired cytosine codon in the CpG target site and 
that is located 5' upstream from a CpG island and are frequently hypermethylated in promoter 
regions of somatic genes in malignant tissue. Preferably, the Ms-SNuPE primer sequence 
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comprises a sequence of at least fifteen bases having a sequence selected from the group 
consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl 
[SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], 
HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 

5 11], CaS4 [SEQ ID NO. 1 2], and combinations thereof. The present invention further provides 
a method for obtaining a Ms-SNuPE primer sequence, comprising finding a hypermethyiated 
CpG island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5' upstream from the hypermethyiated CpG island, and isolating a 15 to 
25 mer sequence 5' upstream from the hypermethyiated CpG island for use as a Ms-SNuPE 

10 primer. The present invention further provides a Ms-SNuPE primer comprising a 15 to 25 mer 
oligonucleotide sequence obtained by the process comprising, finding a hypermethyiated CpG 
Island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5* upstream from the hypennethylated CpG island, and isolating a 15 to 
25 mer sequence 5' upstream from the hypermethyiated CpG island for use as a Ms-SNuPE 

15 primer. 

Brief Description of the Drawings 

Figure 1 shows a diagram of the inventive Ms-SNuPE assay for determination of 
strand-specific methylation status at cytosines. The process involves treating genomic DNA 

20 with sodium bisulfite, and generating a template by a PCR technique for a top strand 

methylation analysis. Alternatively a bottom strand methylation can also be assayed by 
designing the appropriate primers to generate a bottom strand-specific template. The process 
further entails amplifying the templates by a PCR technique. The PCR products are 
electrophoresed and isolated from agarose gels, followed by incubation with Ms-SNuPE 

25 primers, as disclosed herein wherein the Ms-SNuPE primers comprise a from about a 15 mer to 
about a 25 mer length primer that terminates immediately 5' of a single nucleotide to be 
assayed, and PCR buffer, [^-P]dNTPs and Tag polymerase for primer extension reactions. The 
radiolabeled products are separated, for example, by electrophoresis on polyacrylamide gels 
under denaturing conditions and visualized by exposure to autoradiographic film or 

30 phosphorimage quantitation. 

Figure 2 shows the results from a quantitative methylation analysis of three top strand 
CpG sites from a 5* CpG island of pi 6. is a known tumor suppressor gene and the 
particular region examined for changes in methylation is the promoter region of this gene. The 
top panel provides the locations of three sites analyzed (numbered 1 , 2 and 3) relative to the 

35 putative transcriptional start sites (vertical arrows pointing upwards) and the exon la coding 
domain. The PCR primers used for top strand amplification of the 5' region of pi 6 (which 
includes putative transcriptional start sites) were 5'-GTA GGT GGG GAG GAG 111 AGT T- 
3* [SEQ ID NO. 13] and 5'-TCT AAT AAC CAA CCA ACC CCT CC-3' [SEQ ID NO. 14]. 
The control sets included "M" PCR product amplified from a plasmid containing bisulfide- 
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specific methylated sequence; "U" PCR product amplified firom a plasmid containing bisulfite- 
specific unmethyiated sequence; and "mix" a 50:50 mixture of methylated and unmethylated 
PCR-amplified plasmid sequences. The DNA samples analyzed included T24 and J82 bladder 
cancer cell lines; wbc (white blood cell), melanoma (primary melanoma tumor tissue sample), 
5 and bladder (primary bladder timior tissue sample). The tissue samples were micro dissected 
from paraffin-embedded tumor material. The grid at the bottom of the lower panel shows the 
ratio of methylated (C) versus unmethylated (T) bands at each site based upon phosphorimage 
quantitation. 

Figure 3 shows a mixing experiment showing a linear response of the inventive Ms- 

1 0 SNuPE assay for detection of cytosine methylation. A T24 bladder cancer cell line DNA 

(predominantly methylated) was added in increasing amounts to a J82 bladder cancer cell line 
DNA (predominantly unmethylated). Figure 3 shows data firom an 18 mer oligonucleotide 
[SEQ ID NO. 16] which was used in multiplex analysis of CpG methylation (site 2) of the pi 6 
5'CpG in combination with a 15-mer and 21 -mer primer [SEQ ID NOS 17 and 15, 

15 respectively] (correlation coefficient =0.99). Both the 15 mer and 21 -mer produced a nearly 
identical linear response as the 1 8-mer. Figure 3 shows data fi:*om three separate experiments. 

Figure 4 shows a schematic diagram that outlines a process for a high-throughput 
methylation analysis. The Ms-SNuPE primer extension reactions are performed and then the 
products are directly transferred to membranes, preferably nylon membranes. This allows for a 

20 large number of samples to be analyzed simultaneously in a high-density format. The 

membrane is washed and exposed to a phosphorimage cassette for quantitative methylation 
analysis and eliminate the need for polyacrylamide gel electrophoresis for data measurement. 

Figure 5 (Panel A) shows results from quantitative analysis of DNA methylation using 
the Ms-SNuPE blot transfer technique of Figure 4. Levels of DNA methylation in matched 

25 normal and tumor colon specimens were analyzed in the 5' promoter region of the pi 6 gene. 
The average methylation of 3 sites in the pl6 promoter (Figure 2) was determined by 
quantitating the C:T signal ration by phosphorimage analysis. Panel B shows the results of 
quantitating the average methylation of 3 CpG sites using standard polyacrylamide gel 
electrophoresis compared to dot blot transfers. The average methylation of the monitored sites 

30 in various colon specimens is plotted on the graph and shows little difference between 

quantiiated values derived from polyacrylamide gel electrophoresis compared torn the dotblot 
technique. These data show the feasibility of using the Ms-SNuPE dotblot procedure for high- 
ihroughput detection and quantitation of DNA methylation changes in cancer cells. 

35 Detailed Description of the invention 

The present invention provides a method for determming DNA methylation patterns at 
cytosine sites, comprising the steps of: 
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(a) obtaining genomic DNA from a DNA sample to be assayed, wherein sources of 
DN A include, for example, cell lines, blood, sputum, stool, urine, cerebrospinal fluid, paraffin- 
embedded tissues, histological slides and combinations thereof; 

(b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 

5 cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methylation analysis; 

(c) performing a PGR amplification procedure using the top strand or bottom strand 
specific primers; 

1 0 (d) isolating the PGR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, [^-PJdNTPs 
and Taq polymerase, wherein the Ms-SNuPE primers comprise a firom about a 1 5 mer to about 
a 22 mer length primer that terminates immediately 5* of a single nucleotide to be assayed: and 

(f) determining the relative amount of allelic expression of GpG methylated sites 
1 5 by measuring the incorporation of different ^^P-labeled dNTPs. 

Preferably, the [''P]NTP for top strand analysis is ['^P]dGTP or p-P]TTP. Preferably, 
the [^-P]NTP for bottom strand analysis is [^^PJdATP or [^^P]dGTP. Preferably, the isolation 
step of the PGR products uses an electrophoresis technique. Most preferably, the 
electrophoresis technique uses an agarose gel. 

20 DNA is isolated by standard techniques for isolating DNA from cellular, tissue or 

specimen samples. Such standard methods are found in textbook references such as Fritsch 
zndM^mdXis^ds,, Molecular Cloning: A Laboratory Manual^ \9%9. 

The bisulfite reaction is performed according to standard techniques. For example and 
briefiy, approximately 1 microgram of genomic DNA (amount of DNA can be less when using 

25 micro-dissected DNA specimens) is denatured for 1 5 minutes at 45 °C with 2N NaOH 

followed by incubation with O.IM hydroquinone and 3.6M sodium bisulfite (pH 5.0) at 55 °C 
for 12 hours (appropriate range is 4-12 hours). The DNA is then purified from the reaction 
mixture using standard (commercially-available) DNA miniprep columns, or other standard 
techniques for DNA purification are also appropriate. The purified DNA sample is 

30 resuspended in 55 microliters of water and 5 microliters of 3N NaOH is added for a 

desulfonation reaction, preferably performed at 40 °C for 5-10 minutes. The DNA sample is 
then ethanol-precipitaied and washed before being resuspended in an appropriate volume of 
water. Bisulfite treatment of DNA distinguishes methylated from unmethylated cyiosines. 
The present bisulfite treatment method has advantages because it is quantitative, does not use 

35 restriction enzymes, and many GpG sites can be analyzed in each primer extension reaction by 
using a multiplex primer strategy. 

The PGR amplification step (c) can be performed by standard PGR techniques, 
following a manufacturer's instructions. For example, approximately 1-2 microliters of the 
bisulfite-treated DNA was used as a template for strand-specific PGR amplification in a region 
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of interest. In a PCR reaction profile for amplifying a portion of the pi 6 5* CpG island, for 
example, a procedure of initial denaturation of 94 °C for 3 minutes followed by a cycle of 94 
T of 30 seconds, 68 °C for 30 seconds, 72 °C for 30 seconds for a total of 30 cycles. The 
PCR reactions were performed in 25 microliter volumes under conditions of: —50 ng bisulfite- 

5 converted DNA (less for micro dissected samples), 10 mM Tris-HCl (pH 8.3), L5 mM MgCU, 
50 mM KCl, 0.1% gelatin/ml, 100 of each of dNTP, 0.5 final concentration of each 
primer and 1 unit of Tag polymerase. There are many chromatographic techniques that can be 
used to isolate the PCR amplification products. In one illustrative procedure, approximately 
10-25 microliters of the amplified PCR products were loaded onto 2% agarose gels and 

10 electrophoresed. The bands were visualized and isolated using standard get purification 
procedures. 

The primer extension reaction is conducted using standard PCR primer extension 
techniques but using Ms-SNuPE primers as provided herein. Approximately 10-50 nanograms 
of purified PCR template is used in each Ms-SNuPE reaction. A typical reaction volume is 

1 5 about 25 microliters and comprises PCR template (about 10-50 ng), IX PCR buffer, 1 ^iM of 
each Ms-SNuPE primer, 1 |iCi of the appropriate ^^P-labeled dNTP (either [^-P]dCTP, 
["P]TTP, [^-P]dATP, [^-P]dGTP or combinations thereof), and 1 unit of Tag polymerase. As a 
general rule, oligonucleotides used in the primer extension reactions were designed to have 
annealing temperatures within 2-3 °C of each other and did not hybridize to sequences that 

20 originally contained CpG dinucleoiides. The Ms-SNuPE reactions were performed at 95 °C 

for 1 minute, 50 °C for 2 minutes, and 72 °C for 1 minute. A stop solution (10 microliters) was 
added to the mixtures to terminate the reactions. The inventive Ms-SNuPE assay utilizes 
internal primer(s) which anneal to a PCR-generated template and terminate immediately 5' of 
the single nucleotide to be assayed. A similar procedure has been used successfijlly for 

25 detection of gene mutations Kuppuswamy et al., Proc. Natl. Acad. Sci. USA 88: 1143-1 147, 
1991) and for quantitation of allele-specific expression (Szabo and Mann, Genes Dev. 9:3097- 
3108. 1995 and Greenwood and Burke. Genome Res. 6:336-348, 1996). . 

There are several techniques that are able to determine the relative amount of 
methylation at each CpG site, for example, using a denaturing polyacrylamide gel to measure 

30 '"P through phosphorimage analysis, or transfer of Ms-SNuPE reaction products to nylon 
membranes, or even using fluorescent probes instead of a "P marker. In one method for 
determining the relative amount of methylation at each CpG site, approximately 1-2 microliters 
of each Ms-SNuPE reaction product was electrophoresed onto 15% denaturing polyacrylamide 
gel (7M urea). The gels were transferred to filter paper and then dried. Phosphorimage 

35 analysis was performed to determine the relative amount of radiolabeled incorporation. An 
alternative method for determining the relative amount of methylation at individual CpG sites 
is by a direct transfer of the Ms-SNuPE reaction products to nylon membranes. This technique 
can be used to quantitate an average percent methylation of multiple CpG sites without using : 
polyacrylamide gel electrophoresis. High-throughput methylation analysis was performed by 
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direct transfer of the Ms-SNuPE reactions onto nylon membranes. A total of 100 microliters or 
.0.4 mM NaOH, 1 mM Na4P207 was added to the completed primer extension reactions instead 
of adding stop solution. The mixture was directly transferred to nylon membranes using a 
dotbloi vacuum manifold in a 96 well plate format. Each vacuum transfer well was washed a 

5 total of 4 times with 200 microliters of 2X SSC, 1 mM Na4P207. The entire membrane was 
washed in 2X SSC, 1 mM Na4P207. The radioactivity of each spot on the dried nylon 
membrane was quantitated by phosphorimaging analysis. 

In the inventive quantitative Ms-SNuPE assay, the relative amount of allelic expression 
is quantitated by measuring the incorporation of different ^^P-labeled dNTPs. Figure 1 outlines 

10 how the assay can be utihzed for quantitative methylaiion analysis. For example, the initial 
treatment of genomic DNA with sodium bisulfite causes unmethylated cytosine to be 
converted to uracil, which is subsequently replicated as thymine during PGR. Methylcytosine 
is resistant to deamination and is replicated as cytosine during amplification. Quantitation of 
the ratio of methylated versus unmethylated cytosine (C versus T) at the original CpG sites can 

1 5 be determined by incubating a gel-isolated PGR product, primer(s) and Taq polymerase with 
either ['-P]dCTP or [^^P]TTP, followed by denaturing polyacryiamide gel elecu-ophoresis and 
phosphorimage analysis. In addition, opposite strand (bottom strand) Ms-SNuPE primers are 
further designed which would incorporate either [^-P]dATP or [^^P]dGTP to assess methylation 
status depending on which GpG site is analyzed. 

20 Ms-SNuPE Primers 

The present invention further provides a Ms-SNuPE primer sequence designed to 
anneal to and terminate immediately 5' of a desired cytosine codon in the CpG target site and 
that is located 5' upstream from a GpG island and are frequently hypermethylated in promoter 
regions of somatic genes in malignant tissue. Preferably, the Ms-SNuPE primer sequence 

25 comprises a sequence of at least fifteen bases having a sequence selected from the group 
consisimg of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl 
[SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], 
HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 
11], GaS4 [SEQ ID NO. 12], and combinations thereof. The present invention further provides 

30 a method for obtaining a Ms-SNuPE primer sequence, comprising finding a hypermethylated 
CpG island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5' upstream from the hypermethylated CpG island, and isolating a 15 to 
25 mer sequence 5' upstream from the hypermethylated CpG island for use as a Ms-SNuPE 
primer. The present invention further provides a Ms-SNuPE primer comprising a 15 to 25 mer 

35 oligonucleotide sequence obtained by the process comprising, (a) identifying hypermethylated 
CpG islands a somatic gene from a malignant tissue or cell culture source, (b) determining the 
sequence located immediately 5* upstream from the hypermethylated CpG island, and (c) 
isolating at least a 15 mer sequence 5' upstream from the hypermethylated CpG island for use 
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as a Ms-SNuPE primer. Preferably the Ms-SNuPE primer sequence is from about 1 5 to about 
25 base pairs in length. 

The ability to detect methylation changes associated with oncogenic transformation is 
of critical importance in understanding how DNA methylation may contribute to 
tumorigenesis. Regions of DNA that have tumor-specific methylation alterations can be 
accomplished using a variety of techniques. This will permit rapid methylation analysis of 
specific CpG sites using the inventive quantitative Ms-SNuPE primer process. For example, 
techniques such as restriction landmark genomic scanning (RLGS) (Ha t a da et al., Proc. Natl. 
Acad ScL USA 88:9523-9527, 1995), methylation-sensitive-representational difference 
analysis (MS-RDA) (Ushijima et al., Proc. Nati Acad ScL USA 94:2284-2289, 1997) and 
methylation-sensitive arbitrarily primed PGR (AP-PCR) (Gonzalgo et al„ Cancer Res, 57: 594- 
599. 1997) can be used for identifying and characterizing methylation differences between 
genomes. 

Briefly, sequence determinations of regions of DNA that show tumor-specific 
methylation changes can be performed using standard techniques, such as those procedures 
described in textbook references such as Fritsch and Maniatis eds,. Molecular Cloning: A 
Laboratory Manual, 1989. Additionally, commercially available kits or automated DNA 
sequencing systems can be utilized. Once specific regions of DNA have been identified by 
using such techniques, the Ms-SNuPE primers can be applied for rapidly screening the most 
important CpG sites that are involved with the specific methylation changes associated with a 
cancer phenotype. ' 

Example 1 

This example illustrates a quantitative methylation analysis of three top strand siies in a 
5' CpG island of pi 6 in various DNA samples using the inventive method. The top panel 
provides the locations of three sites analyzed (numbered 1, 2 and 3) relative to the putative 
transcriptional start sites (venical arrows pointing upwards) and the exon la coding domain. 
The PGR primers used for top strand amplification of the 5' region of pi 6 (which includes 
putative transcriptional start sites) were 5'-GTA GGT GGG GAG GAG TTT AGT T-3' [SEQ 
ID NO. 13] and 5*-TCT AAT AAC CAACCA ACC CCT CC-3' [SEQ ID NO. 14]. The 
reactions were performed in 25 jil total volume under the conditions of 50 ng bisulfite-treated 
DNA. iO mM Tris-HCl (pH 8.3), 1.5 mM MgCK. 50 mM KCl, 0.1% gelatin/ml, 100 of 
each dNTP, 0.5 ^tM final concentration of each primer and lU of Tag polymerase (Boehinger 
Mannheim. Indianapolis, IN). The reactions were hot-started using a 1:1 mixture of 
TaqlTaqStari antibody (Clontech, Palo Alto, CA). 

An initial denaturaiion of 94 °C for 3 minutes was followed by 94 °C for 30 sec, 68 T 
for 30 sec. 72 °C for 30 sec for a total of 35 cycles. The PCR products were separated by 
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electrophoresis on 2% agarose gels and the bands were isolated using a Qiaquick™ gel 
extraction kit (Qiagen, Santa Clarita, CA). 

The Ms-SNuPE reaction was performed in a 25 ml reaction volume with 10-50 ng of PGR 
template incubated in a final concentration of 1 x PGR buffer, 1 ^lM of each Ms-SNuPE 
primer, 1 nCi of either [^^P]dCTP or [^-P]TTP and lU of Ta^ polymerase. The primer 
extensions were also hot-started using a 1: mixture of Taq/TaqSiart antibody. The primers 
used for the Ms-SNuPE analysis were: site 1 5'-TTT TTT TGT TTG GAA AGA TAT-3* [SEQ 
ID NO. 15]; site 2 5*-TTT TAG GGG TGT TAT ATT-3' [SEQ ID NO. 16]; site 3 S'-TTT GAG 
GGA TAG GGT-3' [SEQ ID NO. 17]. The conditions for the primer extension reactions were 
95 ^'C for 1 minute, 50 °C for 2 minutes and 70 for 1 minute. A stop solution (10 fil) was 
added to the reaction mixtures and the samples were loaded onto 1 5% denaturing 
polyacrylamide gels (7 M urea). Radioactivity of the bands was quantitated by 
phosphorimaging analysis. The control sets included "M" PGR product amplified from a 
plasmid containing bisulfide-specific methylated sequence; "U" PGR product amplified from a 
piasmid containing bisulfite-specific unmethylated sequence; and "mix" a 50:50 mixture of 
methylated and unmethylated PCR-amplified plasmid sequences. The DNA samples analyzed 
included T24 and J82 bladder cancer cell lines; wbc (white blood cell), melanoma (primary 
melanoma tumor tissue sample), and bladder (primary bladder tumor tissue sample). The 
tissue samples were micro dissected from paraffin-embedded tumor material. The grid at the 
bottom of the lower panel shows the ratio of methylated (C) versus unmethylated (T) bands at 
each site based upon phosphorimage quantitation. 

These data (Figure 2) show the ability of the inventive assay to detect altered patterns 
of methylation. 

Example 2 

This example illustrates a mixing experiment showing a linear response of the inventive 
Ms-SNuPE assay for detection of cytosine methylation. A T24 bladder cancer cell line DNA 
(predominantly methylated) was added in increasing amounts to a J82 bladder cancer cell line 
DNA (predominantly unmethylated). Figure 3 shows data from an 18 mer oligonucleotide 
[SEQ ID NO. 16] which was used in multiplex analysis of CpG methylation (site 2) of the pi 6 
5*CpG in combination with a 15-mer and 21 -mer primer [SEQ ID NOS 17 and 15, 
respectively] (correlation coefficient =0.99). Both the 1 5 mer and 21 -mer produced a nearly 
identical linear response as the 1 8-mer. Figure 3 shows data from three separate experiments. 
Differential specific activity and incorporation efficiency of each ["P]dNTP was controlled for 
by using a 50:50 mixture of bisulfite-specific methylated versus unmethylated PGR template 
for analysis. 

Example 3 

This example provides a summary of DNA regions for which Ms-SNuPE primers can 
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be designed and the inventive method applied for a quantitative detection of abnormal DNA 
methylation in cancer cells. The sequences are listed according to name, size and frequency of 
hypermethylation in the corresponding cell line or primary tumor. 



fragment 


size (bp) 


methylated in 
colon cell line 


hypermethylated 
in colon cancer 


hypermethylated 
in bladder cancer 


comments 


GaLl 


530 


7/7(100%) 


3/7 (42%) 


3/7 (42%) 


GC content (0.6), 
observed/expected 
CpG (0.63) 


GaL2 


308 


7/7(100%) 


4/5 (80%) 


6/7 (85%) 


GC content (0.6), 
observed/expected 
CpG (0.6) 


GaL4 


177 


7/7(100%) 


1/2(50%) 


Va (75%) 


GC content (0.59). 
observed/expected 
CpG (0.50) 


CaSl 


215 


4/7 (57%) 


0/5 (0%) 


2/7 (28%) 


GC content (0.55), 
observed/expected 
CpG (0.78) 


CaS2 


220 


4/7 (57%) 


3/5 (60%) 


3/7 (42%) 


GC content (0.54), 
observed/expected 
CpG (0.74) 


CaS4 


196 


6/7 (85%) 


0/5 (0%) 


1/7(14%) 


GC content (0.64), 
observed/expected 
CpG (0.84) 


HuNl 


148 


7/7(100%) 


3/5 (60%) 


; 3/7 (42%) ■ 


GC content (0.54), 
observed/expected 
LpQj (O.yy) 


HuN2 


384 


7/7(100%) 


4/5 (80%) 


2/7 (28%) 


GC content (0.6), 
observed/expected 
CpG (0.62) 


HuN3 


178 


6/7 (85%) 


4/5 (80%) 


3/7 (42%) 


GC content (0.53), 
observed/expected 
CpG (0.97) 


HuN4 


359 


7/7(100%) 


3/5 (60%) 


4/7 (57%) 


GC content (0.51), 
observed/expected 
CpG (0.47) 
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HuN5 


251 


7/7 (100%) 


2/5 (40%) 


5/7(71%) 


GC content (0.63), 
observed/expected 
CpG (0.77) 


HuN6 


145 


6/7 (85%) 


3/4 (75%) 


1/2 (50%) 


GC content (0.55), 
observed/expected 
CpG (0.47) 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANTS: Mark L. Gonzalgo and Peter A. Jones 

(ii) TITLE OF INVENTION: A CANCER DIAGNOSTIC METHOD BASED 
UPON DNA METHYLATION DIFFERENCES 

10 (iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIS WRIGHT TREMAINE 

(B) STREET: 2600 Century Square, 1501 Fourth Avenue 
15 (C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: U.S.A. 

(F) ZIP: 98101 

20 (V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: PC compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: Word 

25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: to be assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Oster, Jeffrey B. 

(B) REGISTRATION NUMBER: 32,585 

;C) REFERENCE /DOCKET NUMBER: 4 7675-2 

35 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 2C6 628 7711 

(B) TELEFAX: 206 628 7699 

40 (2) INFORMATION rOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 53 0 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: GaLl 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



1 CCCGCGACCT AAGCCAGCGA CTTACCACGT TAGTCAGCTA AGAAGTGGCA 50 

51 GAGCTGGGAT TCGAACCTAT AAAGAACTCT GAAGCCTGGG TATTTTTACA 100 

55 101 TGACACTTTA CATAATGCGC CACGGGGTAG TCGGAGGGGG AGGTCCATCT 150 

151 CCCTTTCCCT TGCTGTCCAT CTCCACAGAA AAGAAGCAAG TGGAGGACAG 2 00 

201 GAGCCAGAAA GTCATCTGGC CGCGGATCAT TCCGGAGTGA CCCCCGCCGC 2 50 

251 CACCACTCGC ATAGTCCGCT TATGGCGGGA GGGCACCTCA GAGATTCTCA" 3 0O 
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301 CAGGGGCTGT GCGGCCAGAA CCAGAAGTGC AAAGCACCGT TAGCGACTCT 350 
351 ATCGCCCCCT GCCGCCTGTG GCGCCCAGTC CGAAGCTGCT GTTTTCAGGA 400 
401 GGGCTAGTGG GCTAAGAAAA GAGCTCACCG ACTGACTGCC CAACAGCTGT 450 
451 TGCGAGCCAG TGCTAGGCTG CAGACAGCCT TGCCAAATGT GGTGACATAA 500 
5 501 GCGGGAGGGG GGAACATTTA GAGAGCCCTA 530 



(2) INFORMATION FOR SEQ ID NO : 2 : 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 08 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

15 

(ii) MOLECULE TYPE: GaL2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



20 1 CTAGGGTAGG CTGGTCTGTG CTGGATACGC GTGTTCTTCT GCGGAGTTAA 50 

51 AGGGTCGGGG ACGGGGGTTC TGGACTTACC AGAGCAATTC CAGCCGGTGG 100 

101 GCGTTTGACA GCCACTTAAG GAGGTAGGGA AAGCGAGCTT CACCGGGCGG 150 

151 GCTACGATGA GTAGCATGAC GGGCAGCAGC AGCAGCAGCC AGCAAAAGCC 200 

201 TAGCAAAGTG TCCAGCTGCT GCACTGCCGC GGGGACTCCC ACATCACCAT 250 

25 2 51 GACTAGTTGT GCAACTCTGC AGCAGAAACG GCTTCCGAGG AACACAGGAT 300 
3 01 CGCGGGGG 3 08 



30 



(2) INFORMATION FOR SEQ ID NO : 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: GaL4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

40 

1 GCTTCCTTTT TCTCGGCTTT CCTCACTATC CTCTCCCTGT TCGAGAGTAT 50 
51 CTCCACCAGC ACCGAGCCTC ACACGGGCTG TGCCTCCATC TTTGGAATGC 100 
101 CTACCCTTCT TTCTTGCGAA GCCCCTCCCA GC-GCCAGCCC TTGTGCACCG 150 
45 151 GCTCAAGGGG ACTGCTCTCC TGCCTCG 177 



(2) INFORMATION FOR SEQ ID NO : 4 : 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 3 

(B) TYPE: oliQonucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: HuNl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

15 
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1 TTGCGCCGAT CGTCAAGAAC CTCTCATCCC TGGCAGCAGC AAAGCCAATA 50 

51 TATTTCCATT TCTTATTTCA GTTTGCCACC AAAACAAAGC TGCGCGCGGC 100 

101 TGAGGGCAGG AAGGCGCTGA GACCGACCGA GAAGAAGGGA CGTCCCGG 148 

5 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 84 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



15 



45 



55 



(ii) MOLECULE TYPE: HuN2 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



1 CAGGCCCGCC GAGACTCCAC TCCAACTACC AGGAAATTTC CCGTGGAGCT 50 

51 TCAATTCCTG GGACCCTCCT ACTGCGGGGA GAGTGGTTTC CCTGCCCCAC 100 

20 101 ACCATGCCCT AGGCCCGAGT CTGCGGCTCT TGGGGGATCT CTCCGAGCTC 150 

151 CGACACCGTG TTCGGACCGG GTGCGCCCTG CCGCTGGGGC TCAAGCCTGC 2 00 

201 AGGCGTGAGA ACCGGGGGAC TCTCTATGGC ACCAAGAGCT TCACCGTGAG 2 50 

251 CGTAGGCAGA AGCTTCGCTT TGATCCTAGG GCTTACAAAG TCCTCCTTTG 3 00 

301 GCTGCCCATG ATGGTAAAAG GGCAGTTGCT CACAAAGCGC GAGTGTGTGT 3 50 
25 351 GCCAGACAGT GTAAATGAGT GTTGGGACCG GCGT 3 84 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: HuN3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

1 GGGTCCGTTC GTGAATGCAT GAGCAGGGTG TGAGCGCCAG GGGGTTACAC 5 0 

40 51 TTCTCACGGG TTAAAACCCA GACAACTTCA CGAGGGAACC ACGTGCCATT 10 0 

101 TTAACAGCGT ACGGTCGGGA TCGTGGGACG TCATTAAACG GAGTGGGTTG 150 

151 AGTATGTGAC TCTGTCACCC ATTTTCTG 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 59 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 ( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: HuN4 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



1 CCCCGCGGGG CAGAATCCAA GTGAGTCAGA CACATTGCTC CCTCCCTGCT 50 

51 GCTGCCAGTC CATCTCTTTG CCAACAAACC TGCTTAAAAT GCCAAAGCTG 100 

101 GTCCAAAGTT TCAGGAAAAC AACTTCCGCC AGAGGGCACG TAGAGGGCAC 150 
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151 AGATGCTATA GATGCTTCTC TGACAAACAC TCCTGACCCC CTTGACAGAT 200 
201 TGGAAAATAC ATGGTTCAGA AAGGGTGAGA GATTTCAACT TGAGAAGTGA 250 
251 AACTAGGAAA AGATGGAAGG TGTCCGGATT TCTAGCTCAA GTCCACACAC 300 
3 01 TGCTTCTGCT GCGGTGACTA AATCGTGGCT GTGTTCTCAT CACCTGCCTC 3 50 
5 351 GCGGCGCGC 359 



(2) INFORMATION FOR SEQ ID NO : 8 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



15 



40 



50 



(ii) MOLECULE TYPE: H\iN5 primer 



1 GGCGGGCCTG GGCACCGCGG AGGGGGGGCT TTTCTGCGCC CGGCGAAGCG 50 

51 TGGAACTTGC GCCCTGAGGC AGCGCGGCGA GACCAGTCCA GAGACCGGGG 100 

20 101 CGAGCCTCCT CAGGATTCCT CGCCCCAGTG CAGATGCTGT GAGCTTAGAC 150 

151 GAGGACAGGG CATGGCACTC GGCTTGGCCC GTAGTGGACG GTGTTTTTGC 200 

201 AGTCATGAAC CCAAACGCCG CAAACCTTGA CCGTTTCCCC ACCCGTGTTG T 
251 

25 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: HuN6 primer 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

1 TGAGAGCAGC ATCCTCCCCT GCGTGTGGTT CTCTAACTTA CCTCCTGTAT 5 0 
51 GGGGTCTGCG GACCCAGCAC ACCTCCCGGG CCCCCAAAAA ATTCCAGCTC 100 
10 1 AAGAGCCCTA AAAATCCTTA CCCTGNNAAA GTTTGAGCTT CTCCC 145 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 215 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: CaSl primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



1 ACGCCGGCCA CAGTTCTTCA GTGAAACGCT TCACTCTCTG GTCATAGAGG 50 

51 TAGGAAACTA TAGCTGTCCC AACTAAATGT CAGGACGAAT TAGCCCAGCT 100 

55 101 GGTCACGCTC ACAGTCACCG CCTCCACCAG ACTGAGCGAC CCTCCCAACG 150 

151 GGGTTTGCCG TGTTGGGAGG ACAGCGGAGT TTCGTTGCTG TGTCAATTTG 2 00 
201 TGTAGACGCG GCTGC 215 
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10 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CaS2 primer 



1 CTGCTCTCTT CTCTTCTTTT CCCCTTTCCT CTCCTCTCCC TTTCCTCAGG 50 

51 TCACAGCGGA GTGAATCAGC TCGGTGGTGT CTTTGTCAAC GGGCGGCCAC 100 

101 TGCCGGACTC CACCCGGCAG AAGATTGTAG AGCTAGCTCA CAGCGGGGCC 150 

151 CGGCCGTGCG ACATTTCCCG AATTCTGCAG GTGATCCTCC CGGCGCCGCC 200 

15 201 CCACTCGCCG CCCCCGCGGC 220 



(2) INFORMATION FOR SEQ ID NO: 12: 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : unknown 

25 

(ii) MOLECULE TYPE: CaS4 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

30 1 GGGCGGCACG GAGGGAGTCA GGAGTGAGCC CGAAGATGGA GAGAAGTCGA 50 

51 TTCGCCCAGA GAACGCAAGA CGGTGGATCA GAGATGAGTC CCAGGAACCT 100 
101 CAGAGAGCGA GGCTGACAGG CCCGGGGAGA GGACCGGGCA GGGACAAACC 150 
151 AGCGGACAGA GCAGAGCGCG AAATGGTTGA GACCGGGAAG CGACCT 196 

35 (2) INFORMATION FOR. SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
1 5'-GTA GGT GGG GAG GAG TTT AGT T-3 ' 22 

(2) INFORMATION FOR SEQ ID NO : 14 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY ; unknown 
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(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
1 5'-TCT AAT AAC CAA CCA ACC CCT CC-3 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; Ms-SNuPE primer from pl6 promoter 
region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
1 5'-TTT TTT TGT' TTG GAA AGA TAT-3' 21 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

1 5' -TTT TAG GGG TGT TAT ATT- 3' 18 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
1 5' -TTT GAG GGA TAG GGT-3' 15 
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We claim: 

1 . A method for determining DNA methyiation patterns at cytosine sites, 
comprising the steps of: 

(a) obtaining genomic DNA from a DNA sample to be assayed; 

(b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 
cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methyiation analysis; 

(c) performing a PGR amplification procedure using the top strand or bottom strand 
specific primers; 

(d) isolating the PGR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, ["PJdNTPs 
and Taq polymerase, wherein the Ms-SNuPE primers comprise a firom about a 1 5 mer to about 
a 22 mer length primer that terminates immediately 5' of a single nucleotide to be assayed; and 

(f) determining the relative amoimt of allelic expression of GpG methylated sites 
by measuring the incorporation of different ^-P-labeled dNTPs. 

2. The method of claim 1 wherein the [^^P]dNTP for top strand analysis is 
['-P]dGTP or [''P]TTP. 

3. The method of claim 1 wherein the [^^P]dNTP for bottom strand analysis is 
[''P]dATP or ["P]dGTP. 

4. The method of claim 1 wherein the isolation step of the PGR products uses an 
electrophoresis technique. 

5. The method of claim 4 wherein the electrophoresis technique uses an agarose 

gel. 

6. The method of claim 1 wherein the Ms-SNuPE primer sequence comprises a 
sequence of at least fifteen but no more than twenty five bases having a sequence selected from 
the group consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], 
HuNl [SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 
8], HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], GaSl [SEQ ID NO. 10], CaS2 [SEQ ID 
NO. 1 1], GaS4 [SEQ ID NO. 12], and combinations thereof 

7. A Ms-SNuPE primer sequence designed to anneal to and terminate immediately 
5' of a desired cytosine codon in a CpG target site xomprising an oligonucleotide sequence of 
at least 15 base pairs and corresponding to a gene sequence located immediately 5' upstream 
from the CpG island that is frequently hypermethylaied in promoter regions of somatic genes 
in malignant tissue. 

8. The Ms-SNuPE primer sequence wherein the primer sequence is from about 1 5 
to about 25 base pairs in length and selected from the group consisting of GaLl [SEQ ID NO. 
1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl [SEQ ID NO. 5], HuN2 [SEQ ID 
NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], HuNS [SEQ ID NO. 8], HuN6 [SEQ 
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ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 11], CaS4 [SEQ ID NO. 12], and 
combinations thereof. 

9. A method for obtaining a Ms-SNuPE primer sequence designed to anneal to and 
terminate immediately 5* of a desired cytosine codon in the CpG target site, comprising finding 

5 a hypermethylated CpG island in a somatic gene from a malignant tissue or cell culture, 
determining the sequence located immediately 5* upstream from the hypermethylated CpG 
island, and isolating a 15 to 25 mer sequence 5' upstream from the hypermethylated CpG island 
for use as a Ms-SNuPE primer. 

10. A Ms-SNuPE primer comprising a 15 to 25 mer oligonucleotide sequence 

10 obtained by the process comprising, finding a hypermethylated CpG. island in a somatic gene 
from a malignant tissue or cell culture, determining the sequence located immediately 5* 
upstream from the hypermethylated CpG island, and isolating a 15 to 25 mer sequence 5' 
upstream from the hypermethylated CpG island for use as a Ms-SNuPE primer. 
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